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Let I be an independent set drawn from the discrete d-dimensional 
hypercube Qd = {0, l} d according to the hard-core distribution with 
parameter A > (that is, the distribution in which each independent 
set I is chosen with probability proportional to X' 1 '). We show a 
sharp transition around A = 1 in the appearance of I: for A > 1, 
min{|/n£|, |7nO|} = asymptotically almost surely, where £ and O 
are the bipartition classes of Qd, whereas for A < 1, min{|In£|, |7nO|} 

^^ ■ is asymptotically almost surely exponential in d. The transition occurs 

j — . in an interval whose length is of order 1/d. 

A key step in the proof is an estimation of Z\(Qd), the sum over 
independent sets in Qd with each set I given weight A' ' (a.k.a. the 
hard-core partition function). We obtain the asymptotics of Z\{Qd) 
for A > y/2 — 1, and nearly matching upper and lower bounds for A < 

K> ' v2 — 1, extending work of Korshunov and Sapozhenko. These bounds 

C3 ■ allow us to read off some very specific information about the structure 

of an independent set drawn according to the hard-core distribution. 
We also derive a long-range influence result. For all fixed A > 0, 
if I is chosen from the independent sets of Qd according to the hard- 
core distribution with parameter A, conditioned on a particular v G £ 
being in /, then the probability that another vertex w is in I is o(l) 
for w £ O but 0(1) for w G £. 
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1 Introduction and statement of results 

The focus of this paper is the discrete hypercube Qd- This is the graph on 
vertex set V = {0, l} d with two strings adjacent if they differ on exactly 
one coordinate. It is a <i-regular bipartite graph with bipartition classes £ 
and O, where £ is the set of vertices with an even number of l's. Note that 
\£\ = \0\ = 2 d ~ 1 . (For graph theory basics, see e.g. [1]). 

An independent set in Qd is a set of vertices no two of which are adjacent. 
Write X(Qd) for the set of independent sets in Qd- The hard-core model with 
parameter A on Qd (abbreviated hc(A)) is the probability distribution on 
X{Qd) in which each / is chosen with probability proportional to A' 7 '. This 
fundamental statistical physics model interprets vertices of a graph (in this 
case, Qd) as sites that may or may not be occupied by massive particles, and 
edges as bonds between pairs of sites (encoding, for example, spatial prox- 
imity). The occupation rule is that bonded sites may not be simultaneously 
occupied, so a legal configuration of particles corresponds to an independent 
set in the graph. In this context A represents a density parameter, with 
larger A favouring denser configurations. (For an introduction to the hard- 
core model from a combinatorial perspective, see for example [2].) 

In [8], Kahn used entropy methods to make an extensive study of the 
hard-core model on Qd (and regular bipartite graphs in general) for fixed 
A > 0. One of the main results is that an independent set from Qd chosen 
according hc(A) exhibits phase coexistence — it comes either predominantly 
from £ or predominantly from O. Specifically, it is shown in [8] that for fixed 
A,£ > 0, and for / chosen from I(Qd) according to hc(A), both of 

2 d 



X -2 d ~ l 



< 



1 1 + A 

min{|/n£|,|/ne>|} < 



2 d 



hold asymptotically almost surely (a.a.s.), that is, with probability tending 
to 1 as d — > oo. Informally, the work of [8] demonstrates that for all fixed 
A > 0, hc(A) is close to |/i£ + \\io where fi£ (or /xp) is a random subset of 
£ (or O) in which each vertex is chosen to be in the set independently with 
probability j^. (This is just hc(A) on £ (or O).) 

Kahn's estimates on 1 1\ and min{|/n£|, |JnC|} do not involve A. Here we 
are able to obtain more precise estimates that capture the dependence on A 
and in particular show that A = 1 is a critical value around which a transition 



occurs in the nature of the phase coexistence: for A > 1, the smaller of | ln£\, 
\I fl 0\ is a.a.s. 0, whereas for A < 1, it is a.a.s. exponential in d. Allowing 
A to vary with d, we find that the transition between min{|/ fl E |, \I D 0\) 
being a.a.s. and a.a.s. going to infinity with d occurs in an interval of 
length order 1/d. 

To state our results precisely we consider four possible ranges of A: 

A>i + ^r W 

|A-1|<^ (2) 

y /2_ l + (V2 + n(i))io g d < A < x _ ^) (3) 

^<\<V2-1+ (v/I+o( ] ))logd (4) 

where c > is an absolute constant (that we do not explicitly compute). 
Here and in what follows, uj(1) indicates a function of d that tends to infinity 
as d does; o(l) a function that tends to 0; ^(1) a function that is eventually 
always greater than some constant greater than 0; and 0(1) a function that 
is bounded above by a constant. All implied constants will be independent 
of d, all limiting statements are as d — > oo, and where we are not taking a 
limit, we will always assume that d is large enough to support our assertions. 
Unless otherwise indicated, all logarithms are to base e. 

Theorem 1.1 Let I be chosen from I(Qd) according to hc(A). 

1. For X satisfying (QP, a.a.s. 

A2 d ~ 1 



max{|/nf|,|/nO|} 



1 + A 



< 2 d ' 2 ^[d (5) 



and 

min{|/n£|,|/n£>|} = 0. (6) 

2. For X satisfying ^j, a.a.s. (TJP holds. If X = 1+ °} for some constant 



k then for each c£N 



Pr (mini 17 D £\, \I D 01} = c) ~ w , ' exp <^ — e~ k/2 } . (7) 

c! 2 



(K fc/2 ) c _/ K-,,.,-2 



3. For A satisfying (T3j), a.a.s. (EJ) holds, as well as 

min{|/n£|,|/nc»|}-| (^ 



(2 + ^)|(T^riog(|( I ^) (i 



<1 (8) 



where e > is arbitrary. 
4- For X satisfying Q) ; a.a.s. 



max{|/n£|,|/ne>|} 



X2d ~ l <d{\ogd)L^S (9) 



1 + A 
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— -(— ^-rl <mm{|/n^|,|/nC|}<em 2 - (— ?— ] (10) 

where m = m(X, d) < dj \/\ogd is any natural number satisfying 

MTA m+1 (i + V 2m(m+1 \ 1+ 2 x d )d{m+1) = o(i)- (ii) 

The upper bound on m in (fTTj) helps make our analysis more tractable, and 
does not impose any serious restriction: for any A = u>(y/\og d/d), for exam- 
ple, m can be taken to be o(d/y/\ogd). 

The following corollary of Theorem 11.11 is immediate. 

Corollary 1.2 Pr (min{|/ D £ |, \I D 0\} = 0) aoes /rom 1 - o(l) to o(l) as 
A goes /rom 1 + uj(l/d) to 1 — u(l/d). 

For fixed A < v2 — 1 we satisfy (TTT1) by taking m = [l/log 2 (l + A)] and so 
combining (jBJ) and ( TTOl) we also get the following corollary. 

Corollary 1.3 For all fixed A < 1, there are constants c(A) and C(X) (inde- 
pendent of d) such that a.a.s. 



c(A) 



(tTa) <m^{|/n£|,|/nO|}<C'(A)r r ^) . 



Our proof of Theorem 11.11 provides structural information about the 
smaller of I D £ and I D O for I chosen according to he (A). For simplic- 
ity, we state the following result for fixed A (not varying with d). Say that 
A C £ (or O) is 2-linked if AUN(A) induces a connected subgraph, and note 
that each A C £ (or (9) can be partitioned into 2 -components -- maximal 
2-linked subsets. Write I m [ Q for the smaller of I (1 £ and I (1 O (with an 
arbitrary choice being made if|Jnf| = |/nO|). 

Theorem 1.4 Fix A > and let I be chosen from X{Qd) according to hc(A). 
The following statements are all true a.a.s.. 

1. If X > 1 then I min = 0. 

2. If A = 1 then J min consists of k 2-components, each of size 1, with k 
being drawn from a Poisson distribution with parameter 1/2. 

3. If 1 > A > y/2 — 1, then I min consists of k 2-components, each of size 
1, where k satisfies 



a / 2 y 

2 \l + \) 


£ \ 


<-4 (if. 




for any e > . 

^. If2 l /' rn — \ > A > 2 1 /( m+1 ) — 1 for some integer m > 2, then I m i n consists 
of k 2-components, each of size at most m, where k satisfies 

1 A f 2 \ d , A " " x "' 

< k < em— 



A\ogm2\l + X) ~ ~ 2\1 + A 

As will be seen in our proof of Theorem ll.4[ the fourth statement above is 
also true for A satisfying (J3J) as long as m is chosen to satisfy (TTTT) . 

Theorem 11.41 suggests that a sequence of threshold phenomena occur for 
independent sets chosen from Qd according to hc(A). The one we exhibit 
is that as A passes across 1, J m i n goes from being empty to consisting of 
exponentially many singleton 2-components (and nothing else), with these 
2-components arriving (in a Poisson manner) in a window of width 1/d. It 
is tempting to conjecture that for each m > 2, as A passes across 2 1//m — 1, 
7 min goes from having exponentially many 2-components of size m — 1 (and 



smaller), and no 2-components of size m, to having exponentially many 2- 
components of size m (and smaller), with these new 2-components arriving 
(in an appropriate Poisson manner) in a short threshold window. An inter- 
esting direction for future work would be to extend what we have done for A 
satisfying ([T]), (J2J) and (J3J), and determine, for A satisfying (j4]) (and smaller), 
the exact number of 2-components of size k in J min for each relevant k, and 
the exact nature of the transition across 2 1,/m — 1 for each m > 2. 

To understand probabilities associated with the hard-core model, it is 
useful to understand the normalizing constant (or partition function) 

Z x (Q d )= J2 A|/| " 
iei(Q d ) 

In the case A = 1, this just counts the number of independent sets in Qd- 
Motivated by the interpretation of independent sets as "codes of distance 2" 
over a binary alphabet, Korshunov and Sapozhenko [5] gave an asymptotic 
estimate in this case. 

Theorem 1.5 \X(Q d )\ ~ l^el 2 ^ as d ->■ oo. 

The following theorem, which extends Theorem II. 51 to a wider range of A, is 
the main tool in our approach to Theorem 11.11 

Theorem 1.6 

{(2 + o(l))(l + A) 2 " -1 if A satisfies (QJ) 

(2 + o(l))(l + A) 2 "" 1 exp || (££*} if A satisfies @) or (TJ) 
(1 + A) 2 '" 1 exp {f (^) d (1 + o(l))} ifX satisfies g). 

We expect that the range of validity for the third of these estimates can be 
extended to A > (1 + fi(l)) logd/d. Indeed, we already know that the lower 
bound is valid for this range of A, since (l25l) and the second clause of ( J26lh 
which combine to give the lower bound, both hold for A in this range. On the 
other hand, the upper bound would follow immediately from an extension of 
Lemma 13.101 to the range A > (1 + 0(1)) logd/d. 

The gap between the upper and lower bounds for Zx(Qd) for A satisfying 
01]) is the main obstacle to understanding more precisely the evolution of the 
hc(A) independent set, as discussed after the statement of Theorem 11.41 



The phenomenon of phase coexistence is, unsurprisingly, accompanied 
by a long-range influence phenomenon. An independent set I chosen from 
I{Qd) according to hc(A) is drawn (in the range of A that we are considering) 
either predominantly from £ or predominantly from O. If we are given the 
information that a particular vertex (v £ £, say) is in /, then that should 
make it very likely that / is drawn mostly from £. So if we then ask what 
is the probability that another vertex (w, say) is in J, the answer should 
depend on the parity of w, being quite small if w £ O and reasonably large 
if w £ £. This heuristic can be made rigorous. 

Theorem 1.7 Let X satisfy X > ° d °f a . Let I be chosen from X (Qd) according 
to hc(A). Ifu,ve£ and w e O are three vertices in Qd then 

Pr(w£/ | we I) < (1 + A)- d(1 -° (1)) (12) 

and 

Pr{uel\vel)> -(l-o(l)). (13) 

1 + A 

Estimates of Z\(Qd) can also be used to obtain information on the number 
of independent sets of Qd of a given size; this topic will be explored in detail 
in a subsequent paper [4J. 

Very few specific properties of Qd are used in the sequel. We just use the 
fact that it is a regular bipartite graph which satisfies certain isoperimetric 
bounds (specifically, those of Lemmas 13.81 and I3.9J) . Analogues of all of our 
main theorems could be obtained for any family of regular bipartite graphs 
with appropriate isometric properties, but in the absence of an appealing 
general formulation, we confine ourselves here to considering Qd- 

An overview of our approach is given in Section [21 The main technical 
lemma (Lemma I3.1UJ) is stated in Section [31 along with notation and other 
useful lemmas. The proofs of all the stated theorems appear in Section HI 
and a proof of Lemma 13.101 is presented in Section [51 

2 Overview 

A trivial lower bound on Z\(Qd) for all A > is 2(1 + A) 2 — 1: just consider 
the contribution from those sets which are drawn either entirely from £ or 
entirely from O. To improve this to the lower bounds appearing in Theorem 



I1.6[ we consider not just independent sets which are confined purely to either 
S or O. It is easy to see that there is a contribution of 

2 d - 1 A(l + A) 2 " 1 - rf = (l + A) 2 " 1 - 



2 Vl + A 

from those independent sets that have just one vertex from O, (and the 
same from those that have just one vertex from £ ) and more generally a 
contribution of approximately 

■ 2<*-l 1 / A 




from those independent sets which consist of exactly k non-nearby vertices 
on one side of the bipartition, for reasonably small k (by "non-nearby" it is 
meant that there are no common neighbours between pairs of the vertices). 
Indeed, there are 2 ways to chose the bipartition class that has k vertices, 
and approximately ( , ) ~ ^2^ d ~ 1 ^ h ways to choose the k vertices. These 
vertices together have a neighbourhood of size kd, so the sum of the weights 
of independent sets that extend the k vertices is X k (l + A) 2 ~ kd . 
Summing over k we get a lower bound on Z\(Qd) of approximately 



2(1 + A) 2 "" 1 exp 




1 + A. 



This lower bound could have also been achieved by summing only from k 
a little below to a little above | (i+x) (where the mass of the Taylor se- 
ries of exp < | (jxt) f is concentrated) and, once k vertices have been cho- 
sen from one side, only considering extensions to the other side which have 
close to A2 vertices (where the mass of the binomial series (1 + A) 2 is 
concentrated). This does not cause the count of extensions to drop much 
below (1 + A) 2 ~ dk as long as dk is much smaller than 2 d_1 , which it will 

be for k m -| (ytx) an d A > c °f 3 . In this way we see that the lower 
bound on Z\{Qd) can be achieved by only considering independent sets / 
with min{|/ n £\, \I D 0\} w f ( I | x ) d and max{|/ n £\, \I n 0\} « *££-. 
Thus an upper bound that matches the lower bound completes the proofs of 
both Theorems 11.61 and 11.11 (as well as Theorem 11.41 as we shall see). 
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To motivate the upper bound, consider what happens when we count the 
contribution from independent sets that have exactly two nearby vertices 
from O (that is, two vertices with a common neighbour). There are ap- 
proximately d 2 2 d ~ 2 choices for this pair (as opposed to approximately 2 2d ~ 2 
choices for a pair of vertices without a common neighbour), since once the 
first vertex has been chosen the second must come from the approximately 
gP/2 vertices at distance two from the first. The sum of the weights of inde- 
pendent sets that extend each choice is A 2 (l + A) 2 l - 2d + 2 ^ roughly the same 
as the sum of the weights of extensions in the case of the pair of vertices 
without a common neighbour. The key point here is that any pair of vertices 
from O has at most two neighbours in common, so has neighbourhood size 
of approximately 2d, whether or not the vertices are nearby. Thus we get an 
additional contribution of approximately 

j2o<2-2 

(l + Af ' ~ 



(1 + A) M 

to the partition function from those sets with two nearby vertices from O, 
negligible compared to the addition contribution to the partition function 
from those sets with two non-nearby vertices from O. 

The main work in upper bounding Z\(Qd) involves extending this to the 
observation that the only non-negligible contribution to the partition function 
comes from independent sets that on one side consist of a set of vertices 
with non-overlapping neighbourhoods. This in turn amounts to showing 
that there is a negligible contribution from those independent sets which 
are 2-linked on one side. This entails proving a technical lemma (Lemma 
13.10!) bounding the sum of the weights of 2-linked subsets of £ of a given 
size whose neighbourhood in O is of a given size. This lemma is a weighted 
generalization of an enumeration result originally introduced by Sapozhenko 
in [11] and used in [12] to simplify the original proof of Theorem 11.51 A 
weaker form of Lemma 13.101 is proved in [6] where it used to estimate the 
weighted sum of independent sets in Q d satisfying \I D £ \ = \I D 0\. 

3 Preliminaries 

Let S = (V, E) be a finite graph. For A C V write N(A) for the set of 
vertices outside A that are neighbours of a vertex in A, and set 

[A] = {v e V : N({v}) C N(A)}; 



note that if A is an independent set then AC [A]. For bipartite S with 
bipartition X U Y say that A C X (or F) is sma/Hf |[A]| < \X\/2 (or |F|/2). 
Say that A is k-linked if for every it,D 6 i there is a sequence u = 
Mo, U\, ... ,Ui — v in A such that for each i = 0, ...,/ — 1, the length of the 
shortest path connecting Uj and Uj+i is at most fc, or, equivalently, if A is 
connected in the graph obtained from G by joining all pairs of vertices at 
distance at most k (this extends our earlier notion of 2-linkedness). The 
following easy lemma is from [11]. 

Lemma 3.1 If A is k-linked, and T C V is such that for each u G A the 
length of the shortest path connecting u to T is at most £ and for each v G T 
the length of the shortest path connecting v to A is at most £, then T is 
(k + It) -linked. 

Note that for bipartite S and A C X (or Y), if A is 2-linked then so is 
[A]. Any A can be decomposed into its maximal 2-linked subsets; we refer 
to these as the 2-components of A. 

The following lemma bounds the number of connected subsets of a graph; 
see j5j Lemma 2.1]. (The bound given in [5] is (eA) n , but the proof easily 
gives the claimed improvement.) 

Lemma 3.2 Let S have maximum degree A. The number of n-vertex subsets 
ofV which contain a fixed vertex and induce a connected subgraph is at most 
(eA)"" 1 . 

We will use the following easy corollary which follows from the fact that a 
/c-linked subset of a <i-regular graph is connected in a graph with all degrees 
at most (k + l)d k . 

Corollary 3.3 Let S be d-regular with d > 2. The number of k-linked sub- 
sets of V of size n which contain a fixed vertex is at most exp {3nklogd}. 

The next lemma is a special case of a fundamental result due to Lovasz 
[TO] and Stein [13]. For bipartite S with bipartition X UY, say that F'CF 
covers X if each x G X has a neighbour in Y'. 

Lemma 3.4 7/S as above satisfies \N(x)\ > a for each x G X and \N(y)\ < 
b for each y EY then there is some Y' CY that covers X and satisfies 

\Y'\<(\Y\/a)(l + logb). 
10 



The following is a special case of Hoeffding's Inequality [TJ. 
Lemma 3.5 For all X > 0, 5 > and m£N, 

^ A J ( m j > (l-2exp{-25 2 m})(l + A) m . 



J=KlTA- 5 )J 

We will need to compare the exponential function e x to truncates eo{x) 
J2k=o fr °f ^ s Taylor series; the following will be sufficient. 

Lemma 3.6 For any y < x < z with y, z 6 N, 



and 



{6X 1 

ylog hlog(y + lH (14) 



e x - e z (x) < exp \ z log h log I ~ ) )■ . (15) 



Proof. We have 



x — *s X X^ { ( CX \ 

e v( x ) = z^j\-(y + *)-[ - exp J ylog ( — ) + log ^ + 1 ) 



and 

OO u ~ oo 



OO f. z OO , /- / 

«*-«.<*) = E sr^frE(j) Sex p{ zl °s(T) + log (^ 

fc=z+l fe=l *• v 

in both cases using n\ > (n/e) n . □ 

Corollary 3.7 Let {x^l^i be such that x^ — )• oo. With e\ = yjc\ log Xd/xd 
and 62 = a/c2 log Xd/xd where c± > 2 and C2 > 1 are constants, we have 

e[(i+e 2 )x d ](xd) - e [(1 - ei)xd] (x d ) ~ e Xd . 

Proof. Note that the function /(£) = (1 + t) log (y^b) has a local maximum 
at t — and for £ = o(l) satisfies /(£) = 1 — %r + o(t 2 ). From ffT4|) we have 
e[(i-ei)x d ]{xd) = o(e Xd ) and from QTS]) we have e Xd -e[ ( i +e2):rd ](x d ) = o(e 2d ). □ 

For the remainder of this section, we specialize to S = Q^. We will need 
the following isoperimetric bounds for A <Z £ (or O) (see (3J Lemma 6.2] for 
the first and [SI Lemma 1.3] for the second). 

11 



Lemma 3.8 There is a constant Ci SO > such that for A C £ (or O), if 

\A\ < d 4 then \A\ < C iso \N(A)\/d. If \A\ < d/10, then \N(A)\ > d\A\ - 
2\A\(\A\-1). 

Lemma 3.9 For A C £ (or O), if \A\ < 2 d ~ 2 then 

\n(a)\ > (i + n(i/Vd)\\A\. 

Our main tool is a weighted version of a result of Sapozhenko |llj . 

Lemma 3.10 For each a,g > 1, set 

G(a,g) = {AC£ 2-linked : \[A]\ = a and \N(A)\ = g}. 

There are constants c > and d > such that the following holds. If 
\>^0 anda< 2 d ~ 2 then 

Aeg(a, g ) ^ ' ) 

Lemma r3.10l can be proved by combining [TTI Lemma 4.5] with [61 Lemmas 3.3 
and 3.4] (indeed, a key result from [6] is a slightly weaker version of Lemma 
I3.10p . A proof of Lemma 13.101 is given in Section |5j Here we establish the 
following corollary, which is all that we will use in the sequel. 

Corollary 3.11 For X > ^j0 and m < d/^Sgd, 



where the sum is over all A C £ small and 2-linked with \A\ > m. 

Proof. We consider the sum in three parts. Say that A is of type /if \A\ < 
d/10; of type II if d/10 < \A\ < d 4 and of type III if d A < \A\. 

For type I A with \A\ = k (k > m) there are (by Lemma 13.21) at most 
2 d ~ 1 (ed 2 ) k ~ 1 choices for A (the factor of 2 d ~ 1 accounting for the choice of a 
fixed vertex in A and the d 2 coming from the fact that each A is connected 
in a graph with maximum degree at most d 2 ). By Lemma [3.81 each such A 



12 



satisfies |iV(A)| > dk — 2k(k — 1). It follows that the contribution to the sum 
from type I A's is at most 

d/10 

J2 2 d -\ed 2 ) k - 1 \ k (l + A)-^ 2 ^" 1 ). 

k=m 

For large enough d (independent of A, in the range A > c ,°f 3 ) each summand 
above is at most one third its predecessor and so the total sum is at most 

|(ed a )— 1 A w (l + A) 2ro(m - 1} ^p. (16) 

To complete the proof of the corollary we will show that the contributions to 
the sum from A's of type II and type III are negligible compared to ( fl6l) . 

The contribution to the sum from type II A's (again using Lemmas 13.21 
and 13.81) is at most 

d 4 

dk 



J2 2 d ~ 1 (ed 2 ) k - 1 \ k (l + \yc 

k=d/10 



ISO 



(where C lso is the constant from Lemma I3T81) . For large enough d (independent 

clogd 



of A, in the range A > c ,°% ) the first term in this sum is the largest and so 



the sum is at most 

rf 4 2 d - 1 (et/ 2 )Tn- 1 A^(l + A) _T5 ^ 

which is vanishingly small compared to ([TBI) for all A and m in the specified 
range. 

In the range \A\ > d 4 we partition the possible A's according to a : = 
| [A] | > d 4 and g := \N(A)\ > d 4 . By Lemma EUHl the sum over type III A's 
is at most 

a,g>d 4 , g(a,g)^® \A&g{a,g) J a,g>d 4 ^ J 

By Lemma [3.91 g — a > £l(d 7 / 2 ) and there are at most 2 2d choices for a and 
g, so the sum is at most 2~ u ( d \ which again is vanishingly small compared 
to (|16|) for all A and m in the specified range. □ 
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4 Proofs of the main theorems 

4.1 Proof of Theorem 11.61 

We will begin with a general upper bound on Z\{Qd)- 

Lemma 4.1 For any A > ; 

Z x (Q d ) < 2(1 + A) 2 '" 1 exp J J2 A ' A| ( 1 + A ) H7V(A)I 

{ACS small, 2-linked, |A|>1 

To see that this implies the claimed upper bounds, note that 

Ea ,, (1 + a) ^, 1 ^(^)%^1^Q_J_ (17) 

where the sum is over all A C £ small and 2-linked with 1 < \A\ < 2. The 
second term on the right corresponds to \A\ = 2: there are 2 d ~ 1 ( 2 )/2 ways 
to choose A C £ small and 2-linked with \A\ = 2, and each such A has 
|iV(A)| = 2d — 2. The first term corresponds to \A\ = 1. On the other hand, 
from Corollary 13.111 we have that for all A > ^^ 



^A^H-Ar^l < (ed 2 ) 2 A 3 (l + A) 12 T 



1 + A) M 
A 2 (l + A) 2 fd\ 2 d 



;is) 



4 \2) (1 + A) 2d , 
where the sum is now over all A C £ small and 2-linked with \A\ > 3. 

clogd 



Inserting f|T7|) and (fT8j) into Lemma 14.11 we obtain (for A > ^r^- N 



Z A (Q,)<2(l + A) 2d - 1 ex P M( T | I ) +A2 ( 1 + A ) 2rf2 (TTTF^ ( 19 ) 

If A = X(d) satisfies ([1]) then the exponent in ( IT9l) is o(l). If A satisfies 
either ©or © then it is § (^) d + o(l). Finally, if A satisfies ® then it is 
2 (i+a) (1 + °(1))' This gives all the upper bounds of Theorem 11.61 
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Proof of Lemma J^.l A simple argument (based on the fact that Qd has a 
perfect matching) shows that for / e X(Q d ), at least one of \[I D £]\ < 2 d ~ 2 , 
| [I fl O] | < 2 d_2 holds. By £-0 symmetry we therefore have 

Z x (Q d )< 2(1 + \f- 1 J2 ^(l + A)-!^!. (20) 

ACS small 

Decomposing A into 2-components Ax, . . . , A k , we have 

A ]A| (1 + X) -\N(A)\ = "Q A |A,| (1 + xr \N(A t )\ (21) 

8=1 

and 

{k k>0 

l[X^(l + Xr lN{A ' )l : AC Ismail 
i=l A = Uj fc =1 Aj 



< 



£ 



fc! 

fe>0 



exp 



{^Al A l(l + A)-l^)l} (22) 



where the unqualified sum in the last two lines is over all A C £ small and 
2-linked with \A\ > 1. Combining ( l2"2"j) with f[2"Uj) we obtain the lemma. □ 

Before turning to the lower bounds, we combine ( TTTj) . (TTg|) and ( I2"2"j) to 
observe that for A > c °^ 3 (for suitably large c) we have 

X/ H(1+Arl " ( ^ exp {K^) J+ ^r}- < 23) 

Now we turn to the lower bounds on Zx{Qd)i which will follow from a 
general bound that is more than what we need for the proof of Theorem 11.61 
but just what we need for much of Theorems 11.11 and 11.41 



Lemma 4.2 For all A > ^M- and f < £ < 



^ill n^rl f <- <- 2 



d-2 



d ""*" J — *- — d 2 



^„ 2(1+ ^Ti Kiti) m-£}H) 
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This lower bound is obtained by considering only I which satisfy 

f <mm{\ins\,\ino\}<£ 



and 
where 

and 



d <max{|/n£|,|/n£>|} < e 2 
A 



ei 



__ (a*- 1 - <tf) - v /(hgd)(2^-df) 



e 2 = j^- {2 d - 1 - df) + ^(\ogd)(2^-df) 

Before proving the lemma, we use it to obtain the claimed lower bounds 
on Zx(Qd) and complete the proof of Theorem 11.61 

For A satisfying either (0Q) or (EJ, § (■£%)* = 0(1). With / = /(A) = 
and £ = £(\) = \ogd (say), an application of (IT51) yields 



£,» ! W i+ »)) 



A/ 2 >'' 



fc=/(A) 

and we also have 

-p{-^>i-.K1). 

Putting these bounds into Lemma 14.21 we get 

A / 2 w 



z,,(0,/)>(2-.(l))(l + A)- ^xj-^— J >■ ( - n 

Noting that | (y+a) = o(l) for A satisfying ([TJ, we get from (I2"4"j) the claimed 
lower bounds on Z\{Q d ) for A satisfying either (TTJ or §Z§. 

For A satisfying either ([3J) or (jlj), | (y+i) = w (l)- For any e > set 



/(A) 



A / 2 



2 Vl + A 



\ 



\ ( 2 V, /A 

(2 + £)- — log 



2 Vl + A/ ° V 2 Vl + A 
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and 



£(X) 



2 Vl + A 



+ 



\ 



(2 + e) 



2 Vl + A 



log 



A 



2 Vl + A 



An application of Corollary 13.71 yields 



£(A) 

£ 



1 /A 



*=/(A) 

and we also have 



fc! V 2 Vl + A 



> (l-o(l))exp 



2 Vl + A 



exp 



£(X) 2 d 



2J2 



2\2 



> exp <^ -2cf A 



2 d - 2 J " *\ (1 + A) M 

Putting these bounds into Lemma 14.21 we get 



Z x (Q d ) > (2 - o(l))(l + Xf- 1 exp <| ^ f^ 



2d z X 



2\2 



(1 + A) 



2J 



(25) 



Noting that 



2rf z A 



2\2 



ofr 



for A satisfying 
1 + A) 2d " ) o (f (i|x) d ) for A satisfying 



(26) 



we get from (1251) the claimed lower bounds on Z\(Qd) for A satisfying either 
(LI or (H. 



Proof of Lemma \4-S\ For each f < k < £, we consider the contribution to 
the partition function from those / with \I f] £\ = k, ei{k) < \lf]0\ < e2(k) 
and all 2-components of I fl £ having size 1, where 



and 



ei(A;) := 
e 2 (A:) : = 



A 



1 + A 
A 

TTa 



(2 d - 1 - dk) - v / (logrf)(2 d - 1 -rfA;) 
(2 d ~ 1 - dk) + v / (logrf)(2 rf - 1 -dfc). 



If we choose the elements of I fl £ sequentially then each new vertex we add 
removes from consideration at most L) + 1 < d 2 vertices (those vertices 



17 



which are at distance at most 2 from the chosen vertex). So the number of 
choices for I fl £ is at least 



fc! - k\ V 



2 d-i I - k \ —*- 2 d ~ 2 



>^l-exp -^ , (27) 



the second inequality using 1 — x > e 2x for < x < 1/2; the application is 
valid since £ < ^-. 

Once /n£ has been chosen, there are 2 a!_1 — dk vertices in O from among 
which we choose between e\(k) and e 2 (/c) to complete /. The sum of the 
weights of the valid extensions to I is, using Lemma 13.51 

A fc ^ X' f 2 "' 1 7 dk ) >\ h (l + A) 2 " 1 -* (l - j\ . (28) 

j=ei(k) 

Combining (j2"7|) and ( l2"B"j) and noting that ei < ei(fc) and e 2 (fc) < e 2 for 
all / < k < £, we see that the contribution to the partition function from 
those I with / < \I n £ \ < £ and e% < \I f] 0\ < e 2 is at least 



^gi(KA)0 



f IVU. 2 
eXP "2^ X "^ 



We get at least the same contribution from those / with / < |/nC?| < £, e± < 
\I n £\ < e 2 . Since £ < ei there is no overlap between the two contributions, 
and all I under consideration satisfy / < min{|/ D £\, \I ("1 0\} < £ and 
e\ < max{|7 D £\, \I C\ 0\} < e 2 . This completes the proof of the lemma. □ 

4.2 Proof of Theorem 11.11 

The lower bounds on Z\(Qd) for A satisfying ([!]), ([2]) and (|3j) come from 
considering only / satisfying 

A2 d_1 

6i(A) <max{|/n^|,|/nO|}-- <6 2 (A) 

1 + A 

where 
and 



h (A) = -d£{\) - v /(\ogd)(2^-df(X)) 
b 2 = -df(X) + V '(log d) (2^i -d/(A)) 



with /(A) and £(X) as introduced in the discussion after the statement of 
Lemma 14.21 For all such A we have 

h > -2 d ' 2 ^k^d and b 2 < 2 d/2 v^oid, 



, d 



the main point in both cases being that for A satisfying Q, d\ (ytx)' 
o(2 d / 2 ). Since the lower bounds in this range are asymptotic to the upper 
bounds, that §5§ occurs a.a.s for this range of A follows immediately, as does 
similarly the fact that OH]) holds a.a.s. for A satisfying (J3]). 

That ([6]) holds a.a.s for A satisfying ([1]) follows immediately from Theorem 
11.61 Indeed, the contribution to Zx{Qd) from those I with min{|J D £\, \I (1 
C|} = 0is 

2(1 + A) 2 '" 1 - 1 ~ 2(1 + A) 2d " ~ Z x (Q d ). 



We have to work a little harder to show that (jHJ) and (TTU1) occur a.a.s. for 
A satisfying (J3J. In this range, set 

cl(ln£) <m 
ls(\) = {le X{Q d ) : iifef (tTa)" < HI n£)< em\ (j^) d 

m a x{\in£\,\mo\}-^\<d(\ogd)(^- x y 

(with m as in (TTTT) ) where cl(A) and k(A) are the size of the largest 2- 
component of A and the number of 2-components of A, respectively, and 
define Io(A) analogously. Note that If (A) and Ie>(A) are disjoint and that 
/ G X f (A) satisfies @ and f fTUj) . so the following lemma completes the proof 
that ([9]) and ( ITU]) occur a.a.s. for A satisfying (j4]). 

Lemma 4.3 For A satisfying ([J]) ; 

zx(Q d )~ E Al/| + E *'"• 

/ex £ (A) /ex (A) 

Proof of Lemma \J73\ We begin by considering the contribution to Z\{Qd) 
from those / with I C\£ small and cl(J fl £) > m. With the sum below over 
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such J, and recalling ( I2ip . we have 

ACS small, cl(A)>m 

< (1 + A) 2 " 1 J2 X lA \l + Xp N(A ' )l x 

A'CS small, 2-linked, |j4'|>m 

Y, x lA '\i + xr mA " )l 

A"C8 small 

< Z x (Q d ) J2 A^(l + A)-I^')l 

A'CS small, 2-linked, |A'|>m 

= o(Z x (Q d )), (29) 

where in ( 129]) we have used Corollary 13.111 We similarly have a negligible 
contribution to Zx(Qd) from those / with I C\ O small and cl(J fl O) > m. 

Next we consider the contribution from those / with IdS small, cl(/n£) < 
m and k(I fl £) < 41o 1 — | (jrj) • The contribution is at most 

(1 + A) 2 " 1 J2 ( 2d ~ 1 )m k X k (l + X)- dk (30) 



1 X I 2 



,k / \ / o \ " 



41ogm 2 V 1+A / 



- (1+A) £ ^U(tta) 

— 41ogm 2 V 1+A/ 

< (1 + Xf- 1 exp | (1 - Q(l)) (± (jlJ) ") }> (31) 

= o(z A (g,)). 

The factor of ( 2 fc ) in (I3"0l counts the number of ways of choosing a fixed 
vertex in each of the k 2-components of I fl £. The factor of m k counts the 
number of ways of assigning a size to each 2-component. For each choice of 
a fixed vertex and a size (£i, say) for each 2-component, the contribution to 
Z\(Qd) is at most 

k k 

(1 + A) 2 " 1 JJ W _1 A ft (l + A)-^ 2 ^- 1 ) < (1 + A) 2 " 1 fj A(l + A)- d 
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(for large enough d, independent of A). In (1ST]) we use (TT3|) . 

A similar calculation (using (j!5p in place of f !14p ) shows that the contribu- 
tion from those / with If]£ small, cl(If}£) < m and k(lC\£) > em| (izr) 
is o(Z\(Q d )), and by symmetry so too is the contribution from those I 

with I nO small, cl(J n O) < m and either k{I n 0) < 4^2 (i|x) d 
orfc>emf ( I | x ) d . 

We have shown that ^(1 — o(l)) of Zx(Qd) comes from 



4(A) = { / e x(g 



ci(/n£) < m 



1 A 



41ogm 



Ulh) < k (m£)<eml(^y 



and another |(1 — o(l)) comes from the analogously defined I' (X). (We have 
dropped "I D £ small" since it is implied by the condition on k(I n £)). 

It remains to show that the contribution to X' £ {\) from those / with 
/HO either too large or too small is negligible. For each 41o 1 — | (jtjJ < 
fc < em| ( y+x) an d eac h choice of fc 2-components Ai, . . . , A^ for In £ , the 
contribution to J2iex' (\) ^ * s 



i=o 



2*"i-Xtil^(^ 



J 



By Lemma 13.51 all but a proportion at most -^ of this sum comes from those 
j satisfying 



A2 



d-l 



1 + A 



i=l \ \ i=l 



For all fc in the range under consideration, and all possible choices of the A^s, 
we have 



k / k \ 



This completes the proof. 



□ 
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Finally, we turn to (J7J). Note that the right-hand side of (J7J) is 

Pr(Poisson(7 fc ) = c) 

where Poisson(7fc) is a Poisson random variable with parameter 7^ := |e _fc / 2 . 
For each fixed c £ N we get a lower bound on the contribution to the 
partition function from those / with min{|iTl£|, |/n(9|} = c by considering 
those which have c 2-components on E, each of size 1, and have more than 
logd (say) vertices on O, and the same with E and O reversed. This gives a 
lower bound of 

i=0 \ i<\ogd v to 

which is at least 

(2-o,l)), 1 + Ar'l(A(_|_)y. (32 ) 

Recalling the discussion just before the proof of Lemma I4.3[ we know that 
all but a vanishing part of Zx{Qd) comes from I with the smaller of I fl E, 
If] O consisting of no more than log<i 2-components of size 1. So we get an 
upper bound on the contribution to the partition function from those I with 
min{|/n£|,|/ne>|} = cof 

2 7) AC ( 1 + A ) 2 " 1 " Cd +°(^(^)) < (2+o(l))(l+A) 2d - 1 i (± (-^ 

'(33) 
the inequality following from the fact that in this range of A, 

Z A (g d )~2(l + Ar i exp{^( r ^)}=0((l + Ar 1 ). 

Combining ([32]) and ([33]), and noting that for A = 1 + ^±1, § {-^) d ~ j k , 
it follows that 

Pr (min{|J fl E |, | Jn 0\} = c) ~ Pr(Poisson(7 fc ) = c). 
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4.3 Proof of Theorem 11.41 

The first statement follows immediately from the fact that (Q holds a.a.s. 
for A satisfying (JTJ). The second statement follows from our proof that (JTJ) 
holds a.a.s. for A satisfying (JTJ), once we observe that the lower bound in (JTJ) 
is obtained by only considering those independent sets for which cl(J m i n ) < 1. 
By a similar observation, our proof that (JSJ) holds a.a.s. for A satisfying (J3J) 
also proves the third statement. 

For the fourth statement, note for 2 1 / m — 1 > A > 2 1 '( m +1 ) — 1 we satisfy 
(ITT]) with m = m'; the statement then follows immediately from Lemma [4.31 



4.4 Proof of Theorem 11.7 



Our approach is inspired by [5], in which Galvin and Kahn obtain a result of 
a similar flavour on the lattice 7L d . We begin with ( 1T2|) . Write 

J = {J e l(Q d ) :weJ} and X = {I G J : u G /}. 

Further, write X = {/ G X' : I fl £ small}. We need to bound 

< (l + A)- d(1 -° (1)) , (34) 



M x> ) ^ n v ,_, /ll _ (JlL)) 



where W\(*) = J2ie* ^- ^ e wm snow 



The same argument will show 

w x ({I G X : / H O small}) 
«; A ({ J G X(Q d ) :«eJ}) 



W ^ X ) < (1 + A) -d(i- (D). (35) 



^(l + A)-^ 1 - ™. 



Combining this with ( |35l) we get ( 134"1) . noting that for any I G Z(Qd), either 
I H £ or I H O small, and that by symmetry w\({J G I(Qd) '■ u G J}) = 

We will obtain ( 135]) by producing, for each J G X, a set </?(/) C J7", as 
well as a map v : X x J — \ M supported on pairs (/, J) with J E <p(I) and 
satisfying 

E K 7 ' J) = 1 (36) 
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for each I El and 

E \ lIH Mi,J)<(i+x)- dil ~ oil)) (37) 

for each J E J . It is not difficult to see that the existence of such a 99 and 
v satisfying fl36|) and fl37|) gives f )35|) . 

We produce if as follows. Given I E I, write W(-0 for the 2-component 
of / fl £ containing u. Set 

W(a,g) = {W C £ ; \W\ = a, \N(W)\ = g, u E W, W small and 2-linked} 

and 

Z(a,g) = {lET:W(I)EW(a,g)}. 

Set V = I \ W. Note that N(W(I)) n I = and iV(W(/)) is not adjacent 
to anything in I' (if it was, then W(I) would not be the 2-component of u 
in I fl £). We may therefore add any subset of N(W(I)) to I' and still have 
an independent set. Set 

<p(I) = {I'US: SCN{W(I))}. 

We have just observed that indeed (p(I) Q J . 

For each J E (p(I) write S(J) for J\I' and set 

\S(J) / ^\j\-\i\+\w(i)\ \ 

"' J ' J ^ = (i + \)ww(i))\ y= (i + \yN(wv))\ ) ' 

Since S(J) runs over all subsets of N(W(I)) it is clear that fl36|) holds. To 
see that ( !37l) holds, observe that for fixed J G J7" we have 



E Al 7 l-l J ^(/,J) = £ A IW)l (1 + A) -iv(w(/)) 

< E E A a (l + A)^ 

a,g,W /e^j-^J), I£Z(a,g), W(I)=W 

< E E ^( 1 + ^)~ 9 - (38) 

a,s VFeW(a,g) 

The main point here is fl38l) . which follows from the fact that for each W 7 G 
W(a, p) and J E J there is at most one I E X such that / G I(a,g), 
W(I) = W and J G ¥>"V)- 
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For each g > d 4 we have W(a,g) C U a '< g G(a',g), and so, using Lemma 

ixnufor (EHD, 

£ A a (l + A)^ < £ (1 + A)-' £ A^ 

a, 9 >d 4 , W&W(a,g) g>d 4 , a'<g AeG(a',g,u) 

_ . c (g-a ) logrf 

< 2 d Yl ( 1 + A ) ^ • ( 39 ) 

g>ci 4 , a'<g 

By Lemma [3.91 we have g — a' = Vt{d 7 ^ 2 ) in the range g > d A and we have at 
most 2 d choices for each of a' and g and so 

J2 X a (l + \y g < (l + A)- d(1 -° (1)) . (40) 

a,g>d 4 ,WeW(a,g) 

For # < d 4 we have |W(a,#)| < 2°( alogd ) < 2 ^ lo s rf / d ) and so 

J2 X a (l + \y 9 < J2 (l + A)- 9 2 0(9l0gd/d) (l + A) 0(9/l0S(i) 

a, g<d 4 , WeW(a,g) a, g<d 4 

< ^(i + A ){o(c|fe)-^(3]^xj)+4a}. 

< (l + A)- d(1 -° (1)) . (41) 

Combining (gQJ with (gTJ) we obtain (ESJ) and so fl34"j) and (fT2|) . 

We obtain (Ti~3l) from ( Fl2l) easily. Conditioned on d 6 I, the probability 
that a particular neighbour of w is in I is, by (TT2"j) . at most (1 + A)~ d(1_0< - 1 ' ) ), 
and so the probability that none of the d neighbours of u are in I is at least 
1 - d(l + xy^ 1 -^ 1 )) = 1 - o(l). The probability that m is in / is at least 
the probability that it is in / conditioned on none of neighbours being in 
/ times the probability that none of neighbours are in /, and so is at least 
(l-o(l))A/(l + A). 



5 Proof of Lemma 13.10 



There are three steps to the proof. In the first step (Lemma 15. ip we associated 
to each A e Q(a,g) a pair (F*, S*) that approximates A in the sense that 
F* C N(A), S* D [A] and both of \N(A) \ F*\, \[A] \ S*\ < x hold for some 
suitably small x, and we bound the size of Ai, the set of all pairs (F*, S*) 
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that arise as we run over A G G(a,g) (the bound, of course, depending on x 
as well as a and g). This first step is the most involved of the three, and our 
presentation of it is based closely on Sapozhenko's original treatment [11J . 

The first step may be thought of as a partitioning of Q(a, g), with the |.Ai| 
many partition classes indexed by pairs (F*,S*). The second step (Lemma 
15.21) focuses on the individual partition classes: to each (F*, S*) and A in 
the class indexed by (F*, S*) we associated a pair (F, S) that approximates 
A in the sense that F C N(A), S D [A] and \S\ < \F\ + y for some suitably 
small y, and we bound (uniformly in (F*,S*)) the size of A2, the set of 
all pairs (F, S) that arise as we run over A in the class indexed by (F*, S*) 
(the bound depending on y). This second step essentially appears in work of 
Galvin and Kahn [S] (with a proof also adapted from [UJ ) , and here we only 
show how the conclusion of [5], Lemma 2.17] almost immediately yields our 
desired conclusion. 

In the third step (Lemma I5.3J) we focus on a particular (F, S) and bound 
(uniformly in (F, S)) the sum of the A' A ''s over all those A G Q(a, g) for which 
it holds that F C N(A), S D [A] and \S\ < \F\ + y. This comes directly 
from work of Galvin and Tetali [BJ and so we do not give the proof here. 

The steps together give \Q(a, g)\ < B\\Ai\\A2\ where B\ is the bound 
from the third step. We present the steps in more generality than we need, 
since this adds nothing to the complexity of the proofs. 

Lemma 5.1 Let E be a d-regular bipartite graph with bipartition classes X 
andY. Let g = {A C X 2-linked : \[A]\ =a,|JV(A)| = g}. Fixl<(p< d-1. 
Let 

m v = min {N(K) :yEY,KC N(y), \K\ > y?} . 

Let C > be any constant such that C log d/((pd) < 1. Set t — g — a. There 
is a family A\ C 2 Y x 2 X with 

f 78gClog 2 d 78g log d 78tlog 2 d \ 

L4d < \Y exp < h ~rr, jt-k H : X 

3gC log d N , , 

— £ — \( dg 

< 3tC1 °sy V< dt/(<p(d - <p)) 

(where (^J is shorthand for ^2 i<k \-)) and a map ix\ : Q — V A\ for which 
m(A) :=~(F*, S*) satisfies F* C ~N(A), S* D [A], \N(A) \F*\< td/(d - <p) 
and\S*\[A}\ <td/{d-cp). 
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Lemma 5.2 Let E and Q be as in Lemma \5. 11 Let (F*, S*) G 2 Y x 2 X and 
x > be given. Let 

g' = {Aeg :F* CN{A),S*^ [A],\N{A)\F*\ <x and \S*\[A\\ <x}. 

There is a constant c > 0, a family A% Q 2 Y x 2 X with 

. . . (ex ct log d 

and a map 7r 2 : (?' — > A2 for which 7r 2 (v4) := (F, S) satisfies F C JV(.A), 
5 D [A] and 

|S'|<|F|+2#/(d-V). (42) 



Lemma 5.3 /See JQ Lemma 3.4]] Let X and Q be as in Lemma \5.1\ Let i[> 



and 7 satisfy 1 < ip < d/2 and 1 > 7 > -r^j . Fix (F, S) E 2 Y x 2 X satisfying 



g§. We have 

£ AW < max J (1 + A)^, Q _^ + J (1 + A)-* 

where the sum is over all A E Q satisfying F C iV(A) and S" D [A] . 

Before turning to the proofs, we put them together in the case £ = Qd- 
We set tp = d/2 (which choice allows us to take x = 2t in Lemma 15. 2p and 
ip = d 2 ^ 3 . By Lemma [3781 we have m v > d 2 /(2Ci SO ), and (for large enough d) 
we may set C = 2C iso . Using t > Vt{g/\fd) (from Lemma 13791) and the basic 
binomial estimate 

Q\) < exp {(l + o(l)) (Hog £)} (43) 

for fc = o(n), the first two lemmas combine to give 

ilogd 



|A||-4 2 |<2 d expjO^ d , rl 
We now take 



7 



Ml + A) - g$ffi § - 3 
log(l + A) + 31ogd - d 1 / 3 ' 
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with the inequality valid for A > c d °f 3 . By our choices of ip and 7 we have 

1 - 1 T + 7* < 3g 

a d — ip 



and so 



3dg \ I / 2tip \ ( 3dg 



- exp { 3 (^r^ + 7t ) logd 



:i + a) 



(1-7)* 



with the first inequality using ( |43p and the equality following from the defi- 
nition of 7. It follows that 

maxja + Ar^^^^ii + ArA < (i + xr^ 

so that 

\9(a,g)\ < 2< ! (l + A)»exp|- 7 (log(l + A)+o(^ 

< 2 . (1 + A) » exp j_£^ 

for some c' > (as long as c > is suitably large), as claimed. 
Proof of Lemma I5.il Fix A G Q and set 

N(A)v = {y e N(A) : d [A] (y) ><f} 

(where for any K C V, dx{y) '■= \N(y) D if|). We begin by describing the 
construction of an F' which satisfies N(Ay CF'C N(A) and JV(F') D [A]. 
Since each vertex in N(A) \ F' is in N(A) \ N{A) ip and so contributes at 
least d — ip edges to V(N(A),X \ [A]), a set of size gd — ad = td, such a set 
satisfies \N(A) \F'\ < td/(d — <p). (Here and throughout we use V(A, B) to 
indicate the set of edges with one endpoint in A and the other in B.) 
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Set p = C log d/(<pd). Construct a random subset f of N(A) by putting 
each y G N(A) in T with probability p, these choices made independently. 
We have 

E(\f\) = gp (44) 

and since \V(N(A),X\[A])\ = td, 

E(\V(f 1 X\[A])\)=tdp. (45) 

For y G N(AY we have \N (N [A] ({y}))\ > m v and so 

E(\N(Ay\N(N [A] (f))\) = Yl Pi(y * N(N [A] (f))) 

yEN(A)<P 

= Y, Pr{N{N w {{ V }))nf = 0) 

y€N(A)v> 

< gexp {— pm^} . (46) 

Combining (1441) . (T4"5"j) and (|46[) and using Markov's inequality we find that 
there is at least one T C N(A) satisfying 

lm . 3Cqlogd , _. 

|T„| < ^- (47) 

< 3CWogd 

ipd 
where Q := V(T ,X \ [A]) and 

|iV(^\iV(Ar [A Kro))|<^^. (49) 

Choose one such T and set T^ := N{A)^ \ N{N [A] {T )). Setting L = 
N(N [A] (T ))UT^, we have L D N(A) V . Let 7\ C iV(A) \ L be a cover of min- 
imum size of [A] \ N(L) in the graph induced by (N(A) \L)ll([A]\ N(L)). 
Set F' = L U Ti. By construction, F' satisfies JV^A)* C F' C JV(A) and 
iV(F') D [A]. 

Before estimating how many sets F' might be produced in this way as we 
run over A G Q, we make some observations about the sets described above. 

First, note that by Lemma [37TI F' is 4-linked ([A] is 2-linked, every x G [A] 
is at distance 1 from F' and every y G F' is at distance 1 from [A]) and so 
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T = T U Tq U Xi is 8-linked (every y G T is at distance 2 from X" and every 
y G X" is at distance 2 from T). 

Next, note that F' is completely determined by the tuple (T ,Tq,Ti,Q), 
since X and f2 together determine N(N[a](To)). 

The sizes of To, Tq and $1 are bounded by ( H7|) . f )49|) and (115]), respectively. 
To bound |Ti|, note that as previously observed \N(A) \ L\ < td/(d — ip), 
d[A]\N(L)(u) < rf for each u G G\L, and dc\L(v) = d for each t> G [A] \ N(L). 
So by LemmaGLl |Ti| < (t/(d - y?))(l + logd) < St\ogd/(d- if). 

Combining these observations, we get that T is an 8-linked subset of Y 

with 

3^Clogrf 3^ 3tlogd _ 

By Corollary 13.31 there are \Y\ exp {24Tb OU ndlog(i} possible choices for T. 
Once T has been chosen, there are at most 2 Tbound choices for To C T, at 
most 2 Tbound choices for Ti C T and at most 

3gC log ci 

<P 

^ 3tClogd 

— V 

choices for Q. So the number of choices for (To,Tq,T 15 Q) is at most 

( 3gC log d \ 
< S^logd) ( 50 ) 

— V 

We now describe an algorithmic procedure which produces F* from F', 
and also produces S* (again, for a fixed A). If {-u G [A] : dN(A)\F' (u) > ip} ^ 
0, pick the smallest (with respect to some fixed ordering of the vertices of S) 
u in this set and update F' by F' < — F'UN(u). Repeat this until {u G [A] : 
d N (A)\F'(u) > if} = 0. Then set F* = F' and S* = {u £ X : d F *(u) >d-<p}. 

Observe that F* thus constructed inherits the properties F* C N(A) and 
|iV(y4) \ T*| < td/(d — ip) from T", since we obtain F* from T" by adding 
vertices of N(A). We also have S* D [A], since otherwise the algorithm would 
not have terminated. Since each vertex of S* \ [A] contributes at least d — if 
vertices to V(N(A),X\ [A]), a set of size td, we have \S*\ [A]\ < td/{d-ip). 
Finally, the algorithm is determined by the selection of at most dt/(if(d — f)) 
vertices (each iteration removes at least f vertices from N(A) \ F', a set 
of initial size at most td/(d — if)). These vertices come from [A] which 
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is contained in N(F'), a set of size at most dg. So the total number of 
possibilities for (F*, S*) for each F' is at most 

dg 

< dt/(ip(d-(p)) 

Combining this with ( 150]) . we obtain the claimed bound on \Ai\. □ 

Proof of Lemma \5.B. An almost identical statement appears in [51 Lemma 
2.17], the difference being that ( H2l is replaced by the two conditions dp(u) > 
d—ip for all u £ S and dx\s( v ) > d—ip for all v £ Y\F, (The proof essentially 
repeats the algorithmic procedure described at the end of the proof of Lemma 
15.11 with ip replaced by ip). But these two degree conditions imply (j4"2l . 
Indeed, observe that \V(S, G)\ is bounded above by d\F\ + ip\N{A) \ F\ and 
below by d\[A}\ + (d-^)\S\[A}\ = d\S\-,J>\S\[A]\, giving 

\S\<\F\+M(N(A)\F)U(S\[A])\/d, 

and that each u £ (N(A) \ F) U (S \ [A]) contributes at least d — ip edges to 
V(N(A),X\ [A]), a set of size td, giving 

\{N{A) \F)U(S\A)\< 2td/{d - $). 

These two observations together give (T42I) . □ 
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